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The THO complex is a nuclear 
structure whose architecture is 
conserved among all kingdoms and plays 
an important role in mRNP biogenesis 
connecting transcription elongation with 
mRNA maturation and export. Recent 
data indicates that the THO complex is 
necessary for the proper expression of 
some genes, assurance of genetic stability 
by preventing transcription-associated 
recombination. Yeast THO has been 
described as a heterotetramer (Tho2, 
Hprl, Mftl and Thp2) that performs 
several functions through the interaction 
with other proteins like Texl or the 
mRNA export factors Sub2 and Yral, 
with which it forms the TRanscription 
and EXport complex (TREX). In this 
article we review the cellular role of 
THO, which we show to be composed 
of five subunits with Texl being also an 
integral part of the complex. We also 
show a low-resolution structure of THO 
and localize some of its components. We 
discuss the consequences of THO inter- 
action with nucleic acids through the 
unfolded C-terminal region of Tho2, 
highlighting the importance of unfolded 
regions in eukaryotic proteins. Finally, we 
comment on THO recruitment to active 
chromatin, a role that is linked to mRNA 
biogenesis. 



Transcription of pre-mRNA, its matura- 
tion and mRNA export to the cytoplasm 
are a chain of processes involving coopera- 
tion of many proteins and assemblies. 
Over the years these processes have been 
intensively, albeit independently investi- 
gated. However, the acquired knowledge 



has led to a new vision of these processes as 
working in a coordinated manner. One of 
the examples of the connection between 
all the steps involved in RNA biogenesis 
is the role of the THO complex, thought 
to be formed by Tho2 (180 kDa), Hprl 
(90 kDa), Mftl (45 kDa), and Thp2 
(30 kDa) proteins. 1 During transcription 
elongation, the THO complex is involved 
in packing pre-mRNA molecules into 
RNA— protein assemblies termed mRNPs 2 
and is also essential for efficient co- 
transcriptional recruitment of mRNA 
export factors Yral and Sub2 3 (Fig. 1). 
Disruption of any of these tightly coupled 
steps of production of translation- 
competent mRNA in the cytoplasm leads 
to the activation of the RNA surveillance 
pathway and to the subsequent degrada- 
tion of non-active mRNA molecules. 4 
Lack of any of the THO subunits in yeast 
results in several molecular phenotypes 
like impairment of mRNP formation, that 
in turn leads to defects in transcription 
elongation and export. Lack of the Hprl 
subunit causes mRNA to remain trapped 
in the transcription sites giving rise to 
RNA/DNA hybrids (R-loops), which 
causes hyperrecombination and genomic 
instability. 5 Deletion of the Mtfl subunit 
promotes the accumulation of transcribed 
but not matured and exported RNA, 
which, together with transcriptionally 
active chromatin, pieces of RNA export 
machinery and nuclear pore complexes 
(NPC), forms large aggregates called heavy 
chromatin. 6 Besides, the expression of 
long genes containing internal repeats is 
markedly reduced in tho2 mutants. 7 

Not much is known about the structure 
of THO and even the composition is a 
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matter of debate. Two of its constituent 
proteins, Tho2 and Hprl, present high 
sequence similarity among yeasts, plants, 
insects and mammals. The rest of THO 
core subunits is less conserved but they 
have nevertheless been proposed to play 
similar roles. 8 Several proteins have been 
described to interact with the THO such 
as Mex67, Gbp2 and Hrbl (the serine- 
arginine-rich (SR)-like proteins) and the 
Prpl9 complex, with is involved in 
splicing and transcription elongation. 3,9 " 11 
However, a more stable and stronger 
interaction has been described between 
THO and three other proteins, the 
RNA helicase Sub2/UAP56, the RNA 
binding protein Yral/Aly, and Texl. The 
complex formed by THO and all these 
proteins has been termed TREX (from 
TRanscription and Export), 3 although the 
real subunit composition of both THO 
and TREX is still a matter of speculation. 
Texl was first described as a TREX 
component of unknown function which 
co-purified with THO core subunits in 



yeast, 3 although later was found to co- 
purify in stoichiometric amounts with 
THO isolated from Drosophila and 
human. 3 ' 12,13 This evidence led to 
postulate that Texl could be a bona fide 
THO component, while Sub2/UAP56 
and Yral/Aly may only interact transiently 
with it. 312,13 Determination of the three- 
dimensional structure of the THO com- 
plex should therefore shed additional 
light on the composition of the genuine 
THO complex. 

The Architecture 
of the THO Complex 

THO was purified using a TAP-tagged 
Tho2 14 and this consistently rendered a 
very pure and stable complex composed of 
stoichiometric amounts of the four pro- 
teins already described as THO compo- 
nents (Tho2, Hprl, Mftl and Thp2) and 
a fifth protein, Texl. This indicates that 
THO is a hetero-pentameric complex. 
The three-dimensional reconstruction of 



THO, 15 performed by electron microscopy . — 
and image processing, revealed a long and GO 
conspicuous structure with two lateral 
protrusions (Fig. 2). The resolution of the 
reconstruction (17 A) is not good enough to 
locate the five subunits within the envelope ^ 
of the complex, so other approaches were (0 
used to map some of the components. _J 

Labeling of THO with an anti-Hprl 
antibody that recognizes its C-terminal __ 
part allowed us to locate this region to one ( — } 
of the ends of the THO structure (Fig. 2). 
Hprl is the most evolutionary conserved 
and best functionally characterized THO 
subunit 16 " 19 and it has been found that its 
C-terminal region is ubiquitinated in a 
transcription-dependent manner and the 
ubiquitin moiety is recognized by the 
C-terminal UBA domain of the export 
receptor Mex67. Structural studies have 
indicated that ubiquitinated Hprl and the 
NPC subunits, FG nucleoporins (Nups), 
may bind to the Mex67 UBA in a 
mutually exclusive manner. 16 This suggests 
that the Mex67 export factor may be 
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Figure 2. Three-dimensional structure of THO, generated by electron microscopy and image processing. The THO complex (EMD-2053) comprises Tho2, 
Hpr1 : Mft1,Thp2 and Tex1 proteins. Different electron microscopy experiments have helped to localized Hpr1, Texl and the C-terminal region of Tho2, 
which is involved in nucleic acid binding [and which we show here in close relation with the RNA polymerase (EMD-1322)]. The localization of the rest 
of the subunits is speculative. 



recruited to the nascent transcript via 
ubiquitinated Hprl and then transferred 
to nuclear pore FG Nups. The localization 
of the C-terminal region of Hprl in a 
exposed area of the THO structure 
strengthens the notion of this area of 
Hprl as a protein-interacting region. 

An independent, three-dimensional 
reconstruction of the THO purified from 
a A Texl strain which generated a THO 
complex without Texl, allowed us to 
locate this protein in one of the protruding 
masses of the complex, and the atomic 
model of Texl, a 7-blade [3-propeller 
structure, docked very well in the corres- 
ponding mass of THO (Fig. 2). The 
identification of Texl as a core component 
of the THO complex is in accordance with 
some phenotypes described previously in 
plant orthologs. Since its discovery, the 
function of Texl has remained elusive. 
Deletion of Texl in yeast does not lead to 
any detectable growth phenotype and we 
only have observed a small decrease in 
production of exogenous (3-galactosidase 
mRNA, which is encoded by a long 



GC-reach gene particularly sensitive to 
THO complex. 15 In contrast to yeast, the 
plant Texl seems to play a crucial role. The 
Arabidopsis ortholog of Texl (AtTEXl) 
is important for biosynthesis of a subset 
of endogenous and transgene siRNAs 20 ' 21 
such us TAS1 and TAS2 tasiRNAs, IR71 
siRNAs, and PSuc2:PDS. 20 ' 21 AtTEXl is 
however not essential for biogenesis of 
all small RNAs since miRNAs and most 
cellular endogenous siRNAs, including 
those that are dependent on Pol IV, are 
not affected by mutations in the AtTEXl 
locus. 3 Not surprisingly, in addition to 
AtTexl, mutations in other THO compo- 
nents such as AtTHOl and AtTH06, also 
affect tasiRNAs biogenesis. Proteins with 
WD40 folds are known to be involved 

22 

in protein-protein interaction so it is 
possible that AtTEXl plays a role in the 
interaction of THO with other proteins 
involved in the siRNA pathway. In yeast, 
where the RNAi machinery does not exist, 
Texl seems to play a less crucial role, 
although probably one which involves 
protein-protein interaction. 



The THO Complex is Recruited to 
Active Chromatin through a Basic 
Unfolded Tail with Can Interact 
Directly with Nucleic Acids 

A different labeling experiment was used 
to locate the C-terminal region of Tho2 at 
the tip of the other, thinner and longer 
protrusion 15 (Fig. 2). One of the main 
questions regarding the mechanism of 
THO-mediated mRNP biogenesis is 
how this complex is recruited to active 
chromatin. Two mechanisms have been 
proposed. The first one involves the 
mediation of other proteins in the recruit- 
ment, such as RNA polymerase II, which 
was already described to interact with 
THO, 23 or Syfl, a component of the 
Prpl9 splicing complex that has been 
proposed to be involved in THO complex 
recruitment. 9 The second hypothesis pro- 
poses that THO interacts directly with 
DNA and RNA in vitro. 3 ' 12 ' 13 We have 
found that THO interacts with both DNA 
and RNA by the unfolded C-terminal 
end of Tho2 protein. 15 Without this 
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C-terminal region THO complex is not 
able to bind nucleic acids in vitro, which 
in turn decreases the [3-galactosidase 
expression and slightly increases the level 
of recombination. When Syf 1 is mutated, 
only about 50% of TREX is bound to 
chromatin, but this amount is sufficient 
for proper mRNA export. 9 These two 
findings lead to the conclusion that 
binding of THO to nucleic acids relies 
on a combination of both mechanisms, 
direct and mediated. 

The properties of the Tho2 C-terminal 
region are somehow similar to the serine/ 
arginine-rich domains of SR splicing 
factors. 24 These act as alternative spli- 
cing factors by enhancing or inhibiting 
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